Search CORE

81 research outputs found

An array content static analysis based on non-contiguous partitions

Author: Liu Jiangchao
Rival Xavier
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

International audienceConventional array partitioning analyses split arrays into contiguous partitions to infer properties of sets of cells. Such analyses cannot group together non contiguous cells, even when they have similar properties. In this paper, we propose an abstract domain which utilizes semantic properties to split array cells into groups. Cells with similar properties will be packed into groups and abstracted together. Additionally, groups are not necessarily contiguous. This abstract domain allows to infer complex array invariants in a fully automatic way. Experiments on examples from the Minix 1.1 memory management and a tiny industrial operating system demonstrate the effectiveness of the analysis

INRIA a CCSD electronic archive server

Unleashing Mask: Explore the Intrinsic Out-of-Distribution Detection Capability

Author: Han Bo
Li Hengzhuang
Liu Tongliang
Xu Jianliang
Yao Jiangchao
Zhu Jianing
Publication venue
Publication date: 06/06/2023
Field of study

Out-of-distribution (OOD) detection is an indispensable aspect of secure AI when deploying machine learning models in real-world applications. Previous paradigms either explore better scoring functions or utilize the knowledge of outliers to equip the models with the ability of OOD detection. However, few of them pay attention to the intrinsic OOD detection capability of the given model. In this work, we generally discover the existence of an intermediate stage of a model trained on in-distribution (ID) data having higher OOD detection performance than that of its final stage across different settings, and further identify one critical data-level attribution to be learning with the atypical samples. Based on such insights, we propose a novel method, Unleashing Mask, which aims to restore the OOD discriminative capabilities of the well-trained model with ID data. Our method utilizes a mask to figure out the memorized atypical samples, and then finetune the model or prune it with the introduced mask to forget them. Extensive experiments and analysis demonstrate the effectiveness of our method. The code is available at: https://github.com/tmlr-group/Unleashing-Mask.Comment: accepted by ICML 202

arXiv.org e-Print Archive

Automatic Verification of Embedded System Code Manipulating Dynamic Structures Stored in Contiguous Regions

Author: Chen Liqian
Liu Jiangchao
Rival Xavier
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/11/2018
Field of study

International audienceUser-space programs rely on memory allocation primitives when they need to construct dynamic structures such as lists or trees. However, low-level OS kernel services and embedded device drivers typically avoid resorting to an external memory allocator in such cases, and store structure elements in contiguous arrays instead. This programming pattern leads to very complex code, based on data-structures that can be viewed and accessed either as arrays or as chained dynamic structures. The code correctness then depends on intricate invariants mixing both aspects. We propose a static analysis that is able to verify such programs. It relies on the combination of abstractions of the allocator array and of the dynamic structures built inside it. This approach allows to integrate program reasoning steps inherent in the array and in the chained structure into a single abstract interpretation. We report on the successful verification of several embedded OS kernel services and drivers

INRIA a CCSD electronic archive server

Diversified Outlier Exposure for Out-of-Distribution Detection via Informative Extrapolation

Author: Han Bo
Liu Tongliang
Niu Gang
Sugiyama Masashi
Yao Jiangchao
Yu Geng
Zhu Jianing
Publication venue
Publication date: 26/10/2023
Field of study

Out-of-distribution (OOD) detection is important for deploying reliable machine learning models on real-world applications. Recent advances in outlier exposure have shown promising results on OOD detection via fine-tuning model with informatively sampled auxiliary outliers. However, previous methods assume that the collected outliers can be sufficiently large and representative to cover the boundary between ID and OOD data, which might be impractical and challenging. In this work, we propose a novel framework, namely, Diversified Outlier Exposure (DivOE), for effective OOD detection via informative extrapolation based on the given auxiliary outliers. Specifically, DivOE introduces a new learning objective, which diversifies the auxiliary distribution by explicitly synthesizing more informative outliers for extrapolation during training. It leverages a multi-step optimization method to generate novel outliers beyond the original ones, which is compatible with many variants of outlier exposure. Extensive experiments and analyses have been conducted to characterize and demonstrate the effectiveness of the proposed DivOE. The code is publicly available at: https://github.com/tmlr-group/DivOE.Comment: accepted by NeurIPS 202

arXiv.org e-Print Archive

Regulation of COL1A2, AKT3 genes, and related signaling pathway in the pathology of congenital talipes equinovarus

Author: Haixiang Lv
Jiangchao Zhang
Ningqing Wang
Zhenjiang Liu
Publication venue: 'Frontiers Media SA'
Publication date: 01/07/2022
Field of study

Congenital talipes equinovarus (CTEV) is one of the most common congenital limb defects in children, which is a multifactorial and complex disease that associates with many unknown genetic, social-demographic, and environmental risk factors. Emerging evidence proved that gene expression or mutation might play an important role in the occurrence and development of CTEV. However, the underlying reasons and involved mechanisms are still not clear. Herein, to probe the potential genes and related signaling pathways involved in CTEV, we first identified the differentially expressed genes (DEGs) by mRNA sequencing in pediatric patients with CTEV compared with normal children. The gene of COL1A2 was upregulated, and AKT3 was downregulated at the transcriptional level. Western blot and quantitative polymerase chain reaction (qRT-PCR) results also showed that the expression of COL1A2 in CTEV was enhanced, and the AKT3 was decreased. Furthermore, the COL1A2 Knock-in (+COL1A2) and AKT3 Knock-out (-AKT3) transgenic mice were used to verify the effects of these two genes in the CTEV, and the results of which showed that both COL1A2 and AKT3 were closely related to the CTEV. We also investigated the effect of the PI3K-AKT3 signaling pathway in CTEV by measuring the relative expression of several key genes using Western blot and qRT-PCR. In line with the Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis data, the PI3K-AKT3 signaling pathway might play a potentially important role in the regulation of pathological changes of CTEV. This study will provide new ideas for the mechanism investigation and prenatal diagnosis of CTEV

Directory of Open Access Journals

Shrubland biomass and root-shoot allocation along a climate gradient in China

Author: Jiangchao Guo
Ming Yue
Xiao Liu
Yaoxin Guo
Yongfu Chai
Publication venue: Meise Botanic Garden
Publication date: 01/01/2021
Field of study

Background – Shrublands are receiving increasing attention because of climate change. However, knowledge about biomass allocation of shrublands at the community level and how this is regulated by climate is of limited availability but critical for accurately estimating carbon stocks and predicting global carbon cycles. Methods – We sampled 50 typical shrublands along a climate gradient in China and investigated the biomass allocation of shrubland at the community level and the effect of climate on biomass allocation. Shrub biomass was estimated using species-specific allometric relationships and the biomass of understory herbs was collected by excavating the whole plant. Regression analysis was used to examine the relationships between the biomass and the climate factors. RMA were conducted to establish the allometric relationships between the root and the shoot biomass at the community level.Key results – Shoot, root, and total biomass of shrub communities across different sites were estimated with median values of 206.5, 145.8, and 344.5 g/m2, respectively. Shoot, root, and total biomass of herb communities were estimated at 68.2, 58.9, and 117.2 g/m2, respectively. The median value of the R/S ratio of shrub communities was 0.58 and that of herb communities was 0.84. The R/S ratio of the shrub community showed a negative relationship with mean annual temperature and mean annual precipitation and a positive relationship with total annual sunshine and the aridity index. The R/S ratio of the herb community however showed a weak relationship with climate factors. Shoot biomass of the shrub community was nearly proportional to root biomass with a scaling exponent of 1.17, whereas shoot biomass of the herb community was disproportional to root biomass with a scaling exponent of 2.1.Conclusions – In shrublands, root biomass was more affected than shoot biomass by climate factors and this is related to water availability as a result of biomass allocation change of the shrub community. The understory herb community was less affected by climate due to the modification of the overstory–understory interaction to the climate-induced biomass allocation pattern. Shoot biomass of shrubs scales isometrically with root biomass at the community level, which supports the isometric theory of above-ground and below-ground biomass partitioning

Directory of Open Access Journals

ARPHA OAI-PMH Endpoint

ARPHA Preprints

Exploring Model Dynamics for Accumulative Poisoning Discovery

Author: Du Chao
Guo Xiawei
Han Bo
He Li
Liu Tongliang
Wang Liang
Yao Jiangchao
Yuan Shuo
Zhu Jianing
Publication venue
Publication date: 06/06/2023
Field of study

Adversarial poisoning attacks pose huge threats to various machine learning applications. Especially, the recent accumulative poisoning attacks show that it is possible to achieve irreparable harm on models via a sequence of imperceptible attacks followed by a trigger batch. Due to the limited data-level discrepancy in real-time data streaming, current defensive methods are indiscriminate in handling the poison and clean samples. In this paper, we dive into the perspective of model dynamics and propose a novel information measure, namely, Memorization Discrepancy, to explore the defense via the model-level information. By implicitly transferring the changes in the data manipulation to that in the model outputs, Memorization Discrepancy can discover the imperceptible poison samples based on their distinct dynamics from the clean samples. We thoroughly explore its properties and propose Discrepancy-aware Sample Correction (DSC) to defend against accumulative poisoning attacks. Extensive experiments comprehensively characterized Memorization Discrepancy and verified its effectiveness. The code is publicly available at: https://github.com/tmlr-group/Memorization-Discrepancy.Comment: accepted by ICML 202

arXiv.org e-Print Archive

Combating Bilateral Edge Noise for Robust Link Prediction

Author: Guo Xiawei
Han Bo
He Li
Liu Jiaxu
Wang Liang
Yao Jiangchao
Yao Quanming
Zheng Bo
Zhou Zhanke
Publication venue
Publication date: 02/11/2023
Field of study

Although link prediction on graphs has achieved great success with the development of graph neural networks (GNNs), the potential robustness under the edge noise is still less investigated. To close this gap, we first conduct an empirical study to disclose that the edge noise bilaterally perturbs both input topology and target label, yielding severe performance degradation and representation collapse. To address this dilemma, we propose an information-theory-guided principle, Robust Graph Information Bottleneck (RGIB), to extract reliable supervision signals and avoid representation collapse. Different from the basic information bottleneck, RGIB further decouples and balances the mutual dependence among graph topology, target labels, and representation, building new learning objectives for robust representation against the bilateral noise. Two instantiations, RGIB-SSL and RGIB-REP, are explored to leverage the merits of different methodologies, i.e., self-supervised learning and data reparameterization, for implicit and explicit data denoising, respectively. Extensive experiments on six datasets and three GNNs with diverse noisy scenarios verify the effectiveness of our RGIB instantiations. The code is publicly available at: https://github.com/tmlr-group/RGIB.Comment: Accepted by NeurIPS 202

arXiv.org e-Print Archive